Search results for "Differential item functioning"
showing 10 items of 29 documents
Validation of the Weight Bias Internalization Scale for Mainland Chinese Children and Adolescents
2021
Weight stigma internalization among adolescents across weight categories leads to adverse psychological consequences. This study aims to adapt and validate a Chinese version of the Weight Bias Internalization Scale for Mainland Chinese children and adolescents(C-WBIS). A total of 464 individuals aged 9 to 15 years participated in the present study. Based on item response theory (IRT) and classical test theory (CTT), we selected the items for the C-WBIS and evaluated its reliability and validity. The item response theory yields support for the one-dimensional factor mode. All item parameters fit the IRT model (albeit within an adequate range), eight items were adopted. No evidence of signifi…
The WHO-5 Well-Being Index – Validation based on item response theory and the analysis of measurement invariance across 35 countries.
2020
Abstract Background The five-item World Health Organization Well-Being Index (WHO-5) is a frequently used brief standard measure in large-scale cross-cultural clinical studies. Despite its frequent use, some psychometric questions remain that concern the choice of an adequate item response theory (IRT) model, the evaluation of reliability at important cutoff points, and most importantly the assessment of measurement invariance across countries. Methods Data from the 6th European Working Condition survey (2015) were used that collected nationally representative samples of employed and self-employed individuals (N = 43,469) via computer-aided personal interviews across 35 European countries. …
Using the Johnson-Neyman Procedure to Detect Item Bias in Personality Tests
2016
I present an alternative to a widely used item-bias analysis procedure known as the ANOVA method. The new method is based on the Johnson-Neyman procedure, which is a multiple regression-based technique with properties that can overcome the major weaknesses of the ANOVA method. I introduce the proposed procedure in a non-technical manner, provide an illustrative example, and suggest guidelines for empirical analysis that can be readily applied in personality research.
Item Response Theory Analysis of the Life Orientation Test-Revised: Age and Gender Differential Item Functioning Analyses
2015
This study is aimed at testing the measurement properties of the Life Orientation Test-Revised (LOT-R) for the assessment of dispositional optimism by employing item response theory (IRT) analyses. The LOT-R was administered to a large sample of 2,862 Italian adults. First, confirmatory factor analyses demonstrated the theoretical conceptualization of the construct measured by the LOT-R as a single bipolar dimension. Subsequently, IRT analyses for polytomous, ordered response category data were applied to investigate the items’ properties. The equivalence of the items across gender and age was assessed by analyzing differential item functioning. Discrimination and severity parameters indic…
Gender-related differences in psychometric properties of WHO Disability Assessment Schedule 2.0
2019
Objective of this study was to investigate the gender-related differential item function of 12-item WHODAS 2.0 amongst patients with chronic musculoskeletal pain. This was a cross-sectional survey study among 1,988 patients at a university Physical and Rehabilitation Medicine outpatient clinic. To assess DIF, WHODAS 2.0 items were dichotomized as `none’ rated by respondents as `0’ versus `any limitation’ rated as `1,2,3 or 4’. The item response theory analysis was used to define discrimination and difficulty parameters of a questionnaire. The probit logistic regression was used to test uniformity of DIF between gender groups. The results of DIF analysis were presented and evaluated graphica…
International Education Studies: Increasing Their Linguistic Comparability by Developing Judgmental Reviews
2012
In international education studies, the different-language test versions need to be equally difficult to read and answer for the test to be valid. To ensure comparability, several quality control procedures have been developed. Among these, surprisingly little attention has been paid to judgmental reviews and their ability to identify language-related sources of bias. Also, the reviews have often failed in identifying biases. This paper explored whether it is possible to improve the ability of judgmental reviews to identify language-related sources of bias. A new review was made of two Finnish items which in the PISA (Programme for International Student Assessment) 2000 reading test showed …
Utility of the Mantel-Haenszel Procedure for Detecting Differential Item Functioning in Small Samples
2004
Sample-size restrictions limit the contingency table approaches based on asymptotic distributions, such as the Mantel-Haenszel (MH) procedure, for detecting differential item functioning (DIF) in many practical applications. Within this framework, the present study investigated the power and Type I error performance of empirical and inferential criteria for DIF detection in small samples. Sample sizes (50/50, 100/50, 200/50, and 100/100 for the reference and focal groups, respectively), ability distributions (equal and unequal), and amount of DIF (moderate and high) were manipulated. The results show the advantages of employing theMHchi-square statistic using high levels of significance (•…
Funcionamiento diferencial del ítem de la evaluación internacional PISA. Detección y comprensión
2014
Este trabajo analiza el funcionamiento diferencial del ítem (FDI) de la prueba de comprensión lectora de la evaluación PISA2000 entre la muestras del Reino Unido y España. Se estudian los ítems liberados con el fin de aunar las fases de detección del FDI con la comprensión de sus causas. En la fase de detección se comparan los resultados de los procedimientos Mantel-Haenszel, Regresión Logística y Medias Estandarizadas en sus versiones para ítems dicotómicos y politómicos. Los resultados muestran que dos ítems presentan funcionamiento diferencial aunque el estudio post-hoc llevado a cabo sobre su contenido no ha podido precisar sus causas.
Rasch Analysis of Authentic Evaluation of Young Children's Functioning in Classroom Routines.
2021
This study evaluated the functioning of children in early childhood education classroom routines, using the 3M Functioning in Preschool Routines Scale. A total of 366 children aged 36 to 70 months and 22 teachers from six early childhood education centers in Spain participated in the study. The authors used the Rasch model to determine the item fit and the difficulty of the items in relation to children's ability levels in this age range. The Rasch Differential Item Functioning (DIF) analysis by child age groups showed that the item difficulty differed according to the children's age and according to their levels of competence. The results of this study supported the reliability and validit…
Liberal and Conservative Differential Item Functioning Detection Using Mantel-Haenszel and SIBTEST: Implications for Type I and Type II Error Rates
2004
The aim of this work was to determine, in terms of Type I and Type II error rates, the risks of applying various statistical procedures for evaluating differential item functioning. To this end, the authors carried out a simulation study in which the Mantel-Haenszel and SIBTEST procedures were applied in conjunction. The variables manipulated were sample size and distribution of ability between groups. Results indicated that, although there was a high rate of agreement between the procedures, the joint Type I and Type II error rate may vary substantially from that obtained when each of the procedures was applied separately. Furthermore, the authors analyzed empirical data to obtain informat…